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ENGINEERED PROTEASES FOR AFFINirY PURIFICATION AND PROCESSING OF 

FUSION PROTEINS 



BACKGROUND OF THE INVENTION 

Field of the Invention 



The present invention relates to purification methods, and more particularly, to a fusion protein 
10 comprising a target protein and a protease prodomain protein wherein the prodomain protein 
has higji affinity for binding with a corresponding protease or variant thereof to provide a 
protease binding complex for subsequent recovery of the target protein. 

Description of Related Art 

15 

Recombinant DNA techniques have facilitated the e3q>ression of proteins for diverse 
applications in medicine and biotechnology. However, the purification of recombinant 
proteins is often complicated and problematic. The large-scale, economic pxirification of 
proteins generally includes producing proteins by cell culture, such as bacterial cell lines 
20 engineered to produce the protein of interest by insertion of a recombinant plasmid containmg 
the gene for that protein. Separation of the desired protein fi-om the mixture of compounds fed 
to the cells and from the by-products of the cells themselves to a purity suflEicient for use as a 
human therapeutic poses a formidable challenge. 

25 Procedures for purification of proteins from cell debris initially depend on the site of 
expression of the protein. Some proteins can be caused to be secreted directly from the cell 
into the surrounding growth media; others are made intracellularly. For the latter proteins, the 
first step of a purification process involves lysis of the cell, which can be done by a variety of 
methods, including mechanical shear, osmotic shock, or enzymatic treatments. Such 

30 disruption releases the entire contents of the cell into the homogenate, and in addition produces 
subcellular fragments that are difficult to remove due to their small size. These are generally 
removed by differential centrifiigation or by filtration. The same problem arises, although on a 
smaller scale, with directly secreted proteins due to the natural death of cells and release of 
intracellular host cell proteins in the course of the protein production run. 
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Once a clarified solution containing the protein of interest has been obtained, its separation 
from the other proteins produced by the cell is usually attempted using a combination of 
different techniques. As part of the overall recovery process for the protein, the protein may 
5 be exposed to an immobilized reagent, which binds to the protein. 

Proteomics initiatives of the post genomic era have greatly increased the demand for rapid, 
effective and standardized procedures for the purification and analysis of proteins. For 
example, recombinant proteins are firequenfly fiised with other proteins or peptides to facilitate 
10 purification. The fused domain serves as a temporary hook for affinity purification and 
ultimately must be cleaved off by site-specific proteolysis- A number of fusion protein 
systems using different carrier proteins are now commercially available, particxilarly for E, coli 
expression. Exannples include maltose binding protein, glutathione S-transferase, biotin 
carboxyl carrier protein, thioredoxin and cellulose binding domain. 

15 

Fusion protein expression simplifies the separation of recombinant protein fi-om cell extracts 
by affinity chromatography using an immobilized, moderate-affinity ligand specific to the 
carrier protein. However, typically, immobilization requires the covalent attachment of the 
hgand to the matrix resulting, in many cases, in loss of activity. A typical example of a widely 
20 used product is Protein A-Sepharose. This highly expensive product is used for the 
purification of IgG by affinity chromatography, as well as for many diagnostic protocols. 

Thus, more economical and technically simple methods for purification of soluble proteins, 
which do not involve scale-up of chromatographic procedures, are therefore desirable. 

The function of proteases range from broad specificity, degradative enzymes to highly 
sequence specific enzymes that regulate physiological processes from embryonic development 
to cell death. Some high specificity proteases have been recruited from nature to serve as tools 
for the purification and analysis of proteins in a manner somewhat analogous to use of 
restriction endonucleases to manipulate DNA. The specific processing enzymes currently 
available are from mammalian sources, such as thrombin, factor Xa and Enteropeptidase. 
However, although widely used in protein work fliese natural enzymes are very expensive and 
of low stability limiting their usefulness for many applications. 
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Considerable eflfort has been devoted to engineering robust, bacterial proteases, such as 
subtilisin, to cleave defined sequences. Subtilisin is a serine protease produced by Gram- 
positive bacteria or by fungi. Subtilisins are important industrial enzymes as well as models 
for understanding the enormous rate enhancements affected by enzymes. The amino acid 
5 sequences of numerous subtilisins are known and include subtilisins from Bacillus strains, for 
example, subtilisin BPN', subtilisin Carlsberg, subtilisin DY, subtilisin amylosacchariticus, 
and mesenticopeptidase. For these reasons along with the timely cloning of the gene, ease of 
expression and pxirification and availability of atomic resolution structures, subtilisin became a 
model system for protein engineering studies in the 1980*s. Fifteen years later, mutations in 
10 well over 50% of the 275 amino acids of subtilisin have been reported in the scientific 
literature. Most subtilisin engineering has involved catalytic amino acids, substrate binding 
regions and stabilizing mutations. The most mutagenized subtilisins [1,2] are those secreted 
from the Bacillus species amyloliquefaciens (BPISP), subtilis (subtflisin E) and lentus 
(Savinase). 

15 

In spite of the intense activity in protein engineering of subtilisin it previously has not been 
possible to transform it from a protease with broad substrate preferences into an enzyme 
suitable for processing specific substrates th^eby rendering it useful for protein recovery 
systems. Thus, it would be extremely useful for research and protein purification to be able to 
20 use low specificity proteases such as subtilisin for purification processes. 

SUMMARY OF THE INVENTION 

The invention relates to the discovery that subtilisin and variants thereof are useful in the 
25 purification of proteins when used with a substrate sequence of high affinity for the protease, 
wherein the substrate sequence is preferably the prodomain of subtilisin. Also, disclosed is the 
construction of an expression system for the production of a fusion protein comprising the 
prodomain of subtilisin and a second protein of interest. 

30 Secreted proteases, such as subtilisin, are synthesized as inactive zymogen precursors in order 
to tightly regulate the timing of protease activation [159]. Frequently, the zymogen precursor 
consists of N-terminal amino acids attached to the mature protease sequence. A number of 
these N-terminal extensions (prodomains) are large enough to fold independency and have 
been shown to bind tightly to the active site of the mature protease[149,160-166]- 
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Subtilisin BPN is an extracellular serine proteinase from Bacillus axnyloliquefaciens having a 
primary translation product which is a pre-pro-protein [9,10]. A 30 amino acid pre-sequence 
(SEQ ID NO. 2) serves as a signal peptide for protein secretion across the membrane and is 
5 hydrolyzed by a signal peptidase [167], The extracellular part of the maturation process 
involves folding of prosubtilisin, self-processing of a 77 amino acid sequence (SEQ ID NO. 1), 
to produce a processed complex and finally degradation of the prodonmn to create the 275 
amino acid (SEQ ID NO. 3) mature SET sequence. The 77 amino acid prodomain is removed 
autocatal3rtically and it has been suggested that the prodomain delays the activation of 
10 subtilisin until after secretion from Bacillus [168] because the prodomain is a competitive 
inhibitor of the active subtilisin (Kj of 5.4 x 10"^ M) exhibiting a strong inhibition of the 

activity of the subtilisin. 

Subtilisin's broad preferences result from the manner in which it binds to protein substrates. 

15 Most subtilisin contacts are with the first four amino acids on the acyl side of the scissile bond 
located in the substrate structure. These residues are denoted PI througji P4, numbering from 
the scissile bond toward the N-terminus of the substrate [157], The side chain components of 
substrate binding result primarily from the PI and P4 amino acids [193] [46,47]. Subtilisin 
prefers hydrophobic amino acids at these positions. A high resolution structure of a complex 

20 between subtilisin and prodomain shows that flie C-terminal portion of the prodomain binds as 
a substrate into the subtilisin active site and that the globular part of the prodomain has an 
extensive complementary surface to subtilisin. The C-terminal residues extend out from the 
central part of the prodomain and bind in a substrate-like manner along SBT's active site cleft 
Thus, residues Y77, A76, H75 and A74 of the prodomain act as PI to P4 substrate amino 

25 acids, respectively. These residues conform to subtilisin's natural sequence preferences. The 
folded prodomain has shape complementary and high affinity to native subtilisin mediated by 
both the substrate interactions of the C-terminal tail and a hydrophobic interface provided by 
tlae p- dieet [133], 

30 Likewise, sequencing of the gene for alkaline phosphatase (ALP) revealed that ALP is also 
synthesized as a pro-enzyme. In ALP, the prodomain (166 amino acids) is almost as large as 
the mature protease (198 amino acids). Further, it was demonstrated that the ALP prodomain 
is required to produce active ALP in vivo and that the 166 amino acid prodomain was a strong 
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competitive inhibitor of ALP [170]. Interestingly, structure analysis of the ALP wiA its 
prodomain revealed an affinity binding of the prodomain to the active site of ALP [1 87]. 

Other examples of prodomain mediated folding have been found in all four mechanistic 
5 families of proteases: serine proteases [172-177]; Aspartic proteases [178-180]; 
metalloproteases [181-18S] and cysteine proteases [186]. 

Thus, in one aspect the present invention relates to protein pmification processes using a 
prodomain protein linked to a target protein wherein the prodomain protein has a high afGnity 
10 for the normally associated protease thereby providing for easy separation of the target protein 
from the prodomain. Preferably, the present invention relates to the prodomain of secreted 
proteases such as subtilisin or variants thereof, 'wherein the prodomain has a high affinity for 
the subtilisin or variants thereof 

15 In another aspect, the present invention relates to a fusion protein comprising a protease 
prodomain fused to target protein, wherein cleavage is directed specifically to the peptide bond 
joining the prodomain and the target protein and wherein the prodomain has a high affinity 
binding for the corresponding protease. Preferably, the protease is subtilisin or a variant 
thereof, wherein the variant is modified to specifically hydrolyze the peptide bond between the 

20 protease prodomain and a target protein and/or whose hydrolytic activity may be triggered by 
specific ions. Additionally, the prodomain protein may be optimized by including cognate 
sequences for the protease. 

In yet another aspect, the present invention comprises a prodomain protein of amino acid 
25 sequence SEQ ID NO. 1 fused to a protein of interest. Further the prodomain sequence may 



comprise substitutions in at least the PI - P4 amino acid residues including the following: 



Prodomain 


P4 


P3 


P2 


PI 


Wild-type 
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H 


A 


Y 


Substitutions 


ForY 


any 


AorS 


M,Y,F,HorL 



Several cognate sequences have been found to be highly effective including FKAM, FKAY or 
FKAF. Surprising the addition of the sequences FKAM, FKAY or FKAF also increase the 
30 affinity of the prodomain to the subtilisin to > lO^-M'^ 
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Additionally, the subtilisin prodomain may further include stabilizing mutations to further 
increase its affinity for subtilisin. Still further, mutations may be incorporated into one or 
more of the four catalytic amino acids of subtilisin to drastically reduce its proteolysis of non- 
specific amino acid sequences. Preferred mutations are included at amino acid positions 32, 
5 64, 155 and 221 of the subtilisin sequence identified as SEQ ID NO. 3 and shown in Figure 2. 

Thus, in another aspect, the present invention provide for a processing protease having a Km 
for a cognate sequence in tiie prodomain that is < 1 nm. The kcat of a processing protease is in 
the range of 10** sec"* to 10'^ scc"^ Thus the tumover nimiber (kcat/KnO for the processing 
10 protease and its cognate prodomain substrate is in the range of 10"* M"* s"* to 10^ M"* s'* while 
tumover number vs. a non-specific sequence is < 1 M"* s'\ 

A preferred processing enzyme would prefer its cognate prodomain by > 10^ fold over a non- 
specific sequence. The most preferred embodiments of the invention are processing subtilisins 
15 that have kcat values in the range of 0.001 to 0.0001 s*^ Subtilisins that cleave in this time 
range process the substrate slowly enough to allow affinity purification of any protein 
containing the cognate prodomain as an N-terminal fusion domain. 

In another aspect the present invention provides for a fusion protein comprising a target 
20 protein linked to a domain, wherein the domain protein includes amino acid residues on the C- 
terminal comprising a variant of (E E D K L (FAT) Q S (M/LAT), wherein the C-terminal part 
of the domain causes an affinity for subtilisin or variants thereof. 

In yet another aspect, the present invention provides for a method of generating a subtilisin 
25 prodomain fiision product. An exemplary procedure comprises the following steps: 

providing nucleic acid encoding the subtilisin prodomain fusion protein wherein the 
fusion protein comprises a prodomain of subtilisin or variant thereof and a second protein of 
interest, the prodomain being capable of binding subtilisin or variant thereof with high affinity; 
transfecting a host cell with the nucleic acid or using an equivalent means for 
30 introducing the nucleic acid into the host cell; and 

culturing the transformed host cell under conditions suitable for expression of the 
fusion protein. 

The subject fusion protein will generally be produced by recombinant methods, in particular 
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and preferably by expression of a subtilisin prodomain/second protein DNA wherein the DNA 
will be expressed in microbial host cells, in particular Bacillus subtilis, because this bacteria 
naturally produces subtilisin, is an efficient secretor of proteins, and is able to produce the 
prodomain protein in an active conformation. However, the invention is not restricted to the 
expression of the fusion protein in Bacillus, but rather embraces expression in any host cell 
that provides for e^qjression of the fusion protein. Suitable host cells for expression are well 
known in the art and include, e.g., bacterial host cells such as Escherichia coli. Bacillus, 
Salmonella, Pseudomonas; yeast cells such as Saccharomyces cerevisiae, Pichia pastoris, 
Kluveromyces, Candida, Schizosaccharomyces; and mammalian host cells such as CHO cells. 
Bacterial host cells, however, are the preferred host cells for expression. 

Expression of the DNA encoding the subtilisin prodomain/second protein fusion protein may 
use available vectors and regulatory sequences. The actual selection will depend in a large 
part upon the particular host cells that are utilized for expression. For example, if the fusion 
protein is expressed in Bacillus, a Bacillus promoter will generally be utilized as well as a 
Bacillus derived vector. Expression of the fusion protein in microbial host cells will generally 
be preferred since this will allow for the microbial host cell to produce the subtilisin 
prodomain in a proper conformation. 

A further aspect of the present invention relates to a method for purifying a protein of interest 
from a fusion protein and separation therefrom, the method comprising: 

contacting a fusion protein con^rising a prodomain protein linked to the 
protein of interest with an effective amount of subtilisin or variant thereof xmder 
conditions suitable for the formation of a binding complex between the subtilisin or 
variant thereof and the prodomain protein of the fusion protein; 

incubating the binding complex for a sufficient time for the subtihsin or 
variant thereof to cleave the protein of interest from the binding complex; and 
recovering the protein of interest 

Preferably, the protease has been modified to specifically bind to the protease prodomain 
fusion protein and the protease prodomain protein has been modified to included cognate 
sequences of the protease for autocatalytic removal of the second protein from the binding 
complex. More preferably, the protease is subtilisin or a variant thereof and the prodomain has 
a hi^ binding affinity for such protease. 
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Yet another aspect of the present invention provides nucleic acid encoding a fusion protein 
comprising a protease prodomain protein and a second target protein including a cleavage site 
positioned tiierebetween. Preferably, the cleavage site is upstream of the N-terminal amino 
acid of the second protein of the fusion product. More preferably, the cleavage site is 
downstream from the P4 - PI amino acid residues. 

Cleavage point 
I 

Prodomain P4 P3 P2 PI TARGET PROTEIN 

Another aspect of the present invention provides for a host cell comprised of nucleic acid 
encoding a protease prodomain fusion protein of the present invention. 

An additional aspect of the present invention relates to a diagnostic kit for the detection of a 
substance of interest comprising: 

(a) a protease prodomain fusion protein comprising: 

(i) a protease prodomain capable of binding to a subtiUsin or variant thereof 
with high affinity; and 

(ii) a second protein capable of binding a substance of interest; 

(b) a detectable label; and 

(c) a subtilisin or variant thereof for binding to the protease,prodomain fusion protein. 

Preferably, the prodomain is a subtiUsin prodomain and the second protein may include, but is 
not limited to an enzyme, hormone, antigen, or antibody. 

In another aspect, the present invention relates to an assay method for using the above 
described diagnostic kit for detecting the presence of a substance of interest in a test sample 
comprising: 

(a) incubating a test sample, which may contain a substance of interest, with a 
sufficient amount of a protease prodomain fusion protein, wherein the protease 
prodomain fusion protein comprises: 

(i) a protease prodomain capable of binding with high affinity to a subtilisin or 
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variant thereof, and 

(ii) a second protein capable of binding the substance of interest, 
wherein the incubating conditions permit the binding of the substance of interest to the 
' second protein; 

5 (b) contacting the protease prodomain fusion protein used in step (a) to subtilisin or a 

variant thereof, wherein the subtilisin or a variant thereof is in solution in an amount 
effective to bind the fusion protein and form a binding complex or innmobilized on a 
solid phase to form a subtilisin/prodomain fusion protein binding complex; 

(c) incubating the subtilisin/prodomain fusion protein binding complex for a sufficient 
10 time for autocatalytic cleavage of the second protein from the binding complex; 

(d) recovering the second protein bound to the substance of interest. 

This embodiment further provides for introducing a detectable label wherein the label is 
capable of binding to the substance of interest and determining the presence or absence of the 
15 label, to provide an indication of the presence or absence of the substance of interest in the test 
sample. The detectable label may be introduced either before separation of the second protein 
from the binding complex or after the second protein is recovered. 

The test sanqjle may be a bodily fluid, including, but not limited to, blood, urine, semen, 
20 saliva, mucus, tears, vaginal secretions, and the like. 

In a specific embodiment of the present invention, the method is designed for the detection of 
a specific protein or peptide in a testing sample, thus, the second protein of the prodomain 
subtilisin fusion protein may be an antibody against the specific protein or peptide in the 
25 testing sample. The antibody may be a monoclonal antibody or a polyclonal antibody. The 
subtilisin prodomain of the present invention may be conjugated to the antibo4y either directly 
or through a linker moiety. 

The substance of interest may also comprise a biotinylated probe bound to a protein, peptide, 
30 hormone, nucleic acid or other probe-targetable molecule. The label may include an en2yme, 
that upon adding a sufficient amoimt of a substrate for the enzyme, the substrate is conVCTted 
by the enzyme to a detectable compound. 

Finally, it is a further aspect of the present invention to provide a drug delivery system 
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comprising a subtilisin prodomain protein associated with a therapeutic compound or drug of 
interest to form a fusion product, wherein the fusion product is further complexed to a 
subtiHsin or variant thereof to form a drug delivery complex. In such a drug delivery system, 
the drug of interest may be conjugated to the subtilisin prodonaain either directly or through a 
5 linker moiety. Many methods of conjugation exist and are known in the art. For sample, 
acyl activation agents exist, such as cyclohexylcarbodiimide, which can be used to form amide 
or ester bonds. 

In one embodiment such a drug delivery system can be a slow or sustained drug delivery 
10 system wherein the drug of interest is slowly released from the subtilisin prodomain bound to 
subtilisin. It is contemplated that such a drug delivery system can be incorporated into a 
composition that can be administered parenterally, orally, topically or by inhalation. 
Furthermore, the composition may be in the form of a solid, gel, liquid or aerosol. 

15 Other features and advantages of the invention will be apparent from the following detailed 
description, drawings and claims. 

BRIEF DESCRIPTION OF THE FIGURES 

20 Figure 1 illustrates a ribbon drawing depicting the a-carbon backbone of subtilisin in complex 
with its prodomain. 

Figure 2 shows the amino acid sequence of Subtilisin BPN' wild type. 

25 Figure 3 shows Table 1 setting forth mutations introduced to subtilisin 'BPN. 

Figure 4 shows the rate of protease processing proportional to the concentration of ions. 

Figure 5 shows that the rate of binding of processing subtihsin (SI 89) to the prodomain is 
30 rapid. 

Figure 6 shows that SI 89 cleaves with a half-time of about four hours. A lag phase is evident. 
This lag is useful for protein purification to allow contaminants to be washed away before 
significant cleavage has occurred. 

10 
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Figure 7 shows the results of purification of a fusion protein comprising prSFICAM-Protein G 
with immobilized substrate subtilisin SI 89 or 190 wherein the blot lanes are assigned as 
follows: 

Lane 1: Molecular weight standards - 2pg band 

Lane 2: cell lysate - 10)il of 10 ml from 250 ml culture of 671 prSFKAM- 
Protein G 

Lane 3: flow through from S189AL loaded at 1 ml/niin(10|il fraction 2) 
Lane 4: flow through from S190 AL loaded at 1 ml/min (10|il fraction 2) 
Lane 5: elution from S189 AL after 15 hours (10|il fraction 2, 8 \ig of protein 
G) 

Lane 6: elution from S190 AL after 15 hours (10|il fraction, 4.8 jig of protein 
G) 

Lane 7: strip from S189 AL (IOmI fraction 7, BjAg pR8FKAM) 

Lane 8: strip fromS190 AL (10^1 iSaction 7, 9^gpR8 FKAM) 

Lane 9: strip from SI 89 AL after -10 minutes (lOjj-l fraction 6, 6.4 p.g 671 

FKAM) 

Lane 10: Gb standard 

Figure 8 shows purification results of a-subunit bovine transducin wherein the blot lanes are 
assigned as follows: 

Lane 1: cell lysate - lO^il of 10 ml from 250 ml culture of 671 pr8FKAM- 

ChiT 

Lanes 2-3: Column wash 

Lanes 4-9: elution from SI 89 AL after 15 hours 

Lane 10: pooled fractions 

Figure 9 shows purification results of M. Thermautotrophicus CDC6 wherein the blot lanes are 
assigned as follows: 

Lane 1: Molecular weight standards - 2|j.g band 

Lane 2: cell lysate - lOp.1 of 50 ml from 750 ml culture of pr8FKAM-GDC6 
Lane 3: flow through from SI 89 AL_10 colunm loaded at 10 ml/min 
Lanes 4-8: elution from S189 AL after 15 hours (lOp-l fractions 2-6. 
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Figures 10 A and B shows the HSQC spectra of (a) protein G311 and (b) protein A219 
annotated with residue specific backbone assignments. The two proteins are 59% identical in 
sequence but represent different protein folds by NMR. 

Figure 1 1 shows the results of separation process of 56 amino acid Gb from 671 fusion protein 
(pR58FKAM-Gb) on S189HiTrap NHS column when the fusion protein is bound and washed 
as in the normal procedure. 

Figure 12 shows tiie results when the release of the target protein is triggered by the addition 
of fluoride ions that decreases the time required for purification of the target protein. 

Figure 13 shows the results when the prodomain (pR58) is stripped from the column in O.IM 
H3PO4 as in the normal procedure. 

Figure 14 shows purification results of Streptococcal protein Gb when triggered by the 
addition of KF, wherein the blot lanes are assigned as follows: 

Lane 1: Molecular weight standards ~ 2|ig band 

Lane 2: BL21 DE3 cell lysate - lOpl of 50 ml from 1 L culture of 671 
(pR58FKAM-Gb) 

Injected 1 ml of lysate on S189HT1 column: 
Lane 3: flow through loaded at 1 ml/min (10|il of 2 ml fraction 2) 
Lane 4: flow through loaded at 1 ml/min (10|il of 2 ml fraction 3) 
Lane 5: cleavage/elutionby O.IMKF. (lO^lofl ml fraction l;-7jig total) 
Lane 6: cleavage/elutionby O.IMKF. (10^1 of 1 ml fraction 2; -3 jig total) 
Lane 7: strip by 0.1 M H3PO4 (lOfil of 1 ml fraction 1; -10 ^g total in both 
bands combined). 

Notes: 

1) Coomassie staining of Gb is much weaker than for the pR58 fusion domain. 
Protein concentration was determined by A280 

2) Cleavage reaction was 90% con:q)lete using this cleavage/elution protocol. , 

DETAILED DESCRIPTIGN OF THE INVENTION 

The present invention relates to a prodomain comprising an optimized cognate sequence for 
binding to a highly specific processing subtilisin protease, wherein the pair has particular 
utility for protein purification. 
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The isolated subtilisin prodomain is unfolded but assumes a compact structure with a foxir- 
straiided anti parallel P-sheet and two three-turn a-helices in complex with subtilisin [130,133] 
(Figure 1). The C-terminal residues extend out from the central part of the prodomain and 
bind in a substrate-like manner along SBT's active site cleft. Residues Y77, A76, H75 and 
A74 of the prodomain become PI to P4 substrate amino acids, respectively. These residues 
conform to subtilisin' s natural sequence preferences. The folded prodomain has shape 
complementary and high affinity to native subtilisin mediated by both the substrate 
interactions of the C-terminal tail and a hydrophobic interface provided by the P- dieet [133]. 
The native tertiary structure of the prodomain is required for maximal binding to subtilisin. If 
mutations are introduced in regions of the prodomain, which do not directly contact subtilisin, 
their effects on binding to subtilisin are linked to whether or not they stabilize the native 
conformation. Therefore mutations which stabilize independent folding of the prodomain 
increase its binding affinity [137]. 

As used herein, the term "mutation" refers to an alteration in a gene sequence and/or an amino 
acid sequence produced by those gene sequences. Mutations include deletions, substitutions, 
and additions of amino acid residues to the wild-type protein sequence. 

As used herein, the term "wild-type" refers to a protein, herein specifically a protease or 
prodomain, produced by unmutated organisms. Wild-type subtilisin-Iike proteases are 
produced by, for example. Bacillus alcalophilus, Bacillus amyloliquefaciens, Bacillus 
amylosaccharicus. Bacillus licheniformis, Bacillus lentus, and Bacillus subtilis 
microorganisms. 

The term "variant" as used herein is defined as a protein in which the amino acid sequence, or 
other feature of a naturally occurring molecule has been modified and is intended to include 
mutants. Some of the variants falling within this invention possess amino acid substitutions 
deletions, and/or insertions provided that the final construct possesses the desired binding 
affinity between the protease prodomain and the corresponding protease. Amino acid 
substitutions in the either the protease prodomain protein or the corresponding protease may be 
made on the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity 
and/or the amphipathic nature of the residues involved. For example, negatively charged 
amino acids include aspartic acid and glutamic acid; positively charged amino acids include 
lysine and arginine; amino acids with uncharged polar head groups or nonpolar head groups 
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having similar hydrophilicity values include the following: leucine, isoleucine, valine; glycine, 
alanine; asparagine, glutamine; serine, threonine; phenylalanine, tyrosine. Also included 
within the definition of variant are those proteins having additional anaino acids at one or more 
sites of the C-terminal, N-terminal, as long as the variant retains the binding affinity. 

5 

The variants of the present invention may include subtilisin-like proteases. As used herein, the 
term "subtilisin-like protease" means a protease which has at least 25%, and preferably 80%, 
and more preferably 90% amino acid sequence identity with the sequences of subtilisin and 
maintaining at least the same functional activity of the wild-type protease. 

10 

The present invention is directed to the identification of a protease prodomain that is capable 
of binding a corresponding protease with high affinity. The protease prodomain of the present 
invention is fused to a second protein to form a protease prodomain fusion protein. The 
presence of a protease prodomain protein in a fusion protein allows for easy and selective 
1 s purification of the second protein by incubation with the corresponding protease. 

Examples of a second protein include, but are not limited to protein A, including 
staphylococcal Protein Ab domain and Protein As mutant A219; protein G including 
Streptococcal protein Gb domain. Streptococcal protein Ga domain and Protein Gb mutant 

20 G311; E. coli hypothetical Yab; Bovine a-subunit of transducin; M. thermautotrophicus 
CDC6; streptavidin; avidin; Taq polymerase and other polymerases; alkaline phosphatase; 
RNase; DNase; various restriction enzymes; peroxidases; glucanases such as endo-l,4-beta 
glucanase, endo-l,3-beta-glucanase; chitinases, and others; beta and alfa glucosidases; beta 
and alpha glucoronidases; amylase; transferases such as glucosyl-transferases, phospho- 

25 transferases, chloramphenicol-acetyl-transferase; beta-lactamase and other antibiotic 
modifying and degrading enzymes; luciferase; esterases; lipases; proteases; bacteriocines; 
antibiotics; enzyme inhibitors; different growth factors; hormones; receptors; membranal 
proteins; nuclear proteins; transcriptional and translational factors and nucleic acid modifying 
enzymes. 

30 

The term "protease prodomain protein" refers to prodomain amino acid sequence or functional 
equivalent thereof wherein the protease prodomain protein possesses flie capability of binding 
to a coiresponding protease with high affinity. Preferably, the prodomain is substantially firee 
of other proteins with which it is naturally associated, for instance, the balance of the protease 
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protein. In addition, one or more predetermined amino acid residues in the prodomain may be 
substituted, inserted, or deleted, for example, to produce a prodomain protein having improved 
biological properties, or to vary binding and expression levels. Through the use of 
recombinant DNA technology, the prodomain proteins of the present invention having residue 
deletions, substitutions and/or insertions may be prepared by altering the underlying nucleic 
acid. 

In one embodiment the protease prodomain protein may be fused to an antibody or an 
antigenic determinant as a second protein to form a protease prodomain fusion protein that is 
useful in diagnostic kits and in immunoassays. Thus, for exanq>le, bodily fluids can be tested 
for the presence of particular antibodies by making use of a protease prodomain and an 
antigenic epitope as a second protein fiised to the protease prodomain protein. Conversely, an 
antigen or antigenic portions thereof can be detected using a protease prodomain and antibody 
fusion protein. 

The term "fusion protein" as used herein refers to the joining together of at least two proteins, 
a prodomain protein, preferably being a protease prodomain and a second protein. 
Additionally, the fusion product of the present invention comprises an enzymatic cleavage site 
positioned between the protease prodomain and the second protein. The cleavage site if 
preferably adjacent to the N-terminus of the second protein thereby providing a means for 
recovering the second protein from the fusion product. 

In another embodiment of the invention, the fusion protein is a recombinant fusion product. A 
"recombinant fusion product" is one that has been produced in a host cell that has been 
transformed or transfected with nucleic acid encoding the fusion product, or produces the 
fusion protein as a result of homologous recombination. "Transformation" and "transfection" 
are used interchangeably to refer to the process of introducing nucleic acid into a cell. 
Following transformation or transfection, the nucleic acid may integrate into the host cell 
genome, or may exist as an extrachromosomal element. The "host cell" includes a cell .in in 
viti'o cell culture as well as a cell within a host organism. 

"Nucleic acid" refers to a nucleotide sequence comprising a series of nucleic acids in a 5' to 3* 
phosphodiester linkage that may be either an RNA or a DNA sequence. If the nucleic acid is 
DNA, the nucleotide sequence is either single or double stranded. The prodomain protease 
protein encoding nucleic acid is RNA or DNA that encodes a protein capable of binding the 



15 



wo 2005/017110 



PCT/US2004/021049 



corresponding protease with high affinity, is complementary to nucleic acid sequence 
encoding such protein, or hybridizes to nucleic acid sequence encoding such protein and 
remains stably boimd to it under stringent conditions. 

In constructing the fusion protein expression vector, the nucleic acid encoding the prodomain 
will be linked or joined to the nucleic acid encoding the second protein such that the open 
reading frame of the protease prodomain protein and the second protein is intact, allowing 
translation of the fusion protein product to occur. 

The nucleic acid encoding the prodomain protein of the present invention may be obtained 
from isolated and purified DNA from cell sources or by genomic cloning. Either cDNA or 
genomic libraries of clones may be prepared using techniques well known in the art and may 
be screened for particular protease or protease prodomain encoding nucleic acid with 
nucleotide probes that are substantially complementary to any portion of the gene. 
Altematively, cDNA or genomic DNA may be used as tenqjlates for PGR cloning with 
suitable oligonucleotide primers. Full length clones, i.e., those containing the entire coding 
region of the desired protease prodomain protein may be selected for constructing expression 
vectors, or overlapping cDNAs can be ligated together to form a coniplete coding sequence. 
Altematively, a preferred protease prodomain encoding DNA may be synthesized in whole or 
in part by chemical synthesis using techniques deemed to be standard in the art. 

Methods for recombinant production of polypeptides are well known to those skilled in the art. 
Briefly, for example, host cells are transfected with a polynucleotide that encodes for a 
protease prodomain protein linked to a second protein of choice. Means of transforming or 
transfecting cells with exogenous polynucleotide such as DNA molecules are well known in 
the art and include techniques such as calcium-phosphate- or DEAE-dextran mediated 
transfection, protoplast fusion, electroporation, liposome mediated transfection, direct 
microinjection and adenovirus infection. 

The most widely used method is transfection mediated by either calcium phosphate or DEAE- 
dextran. Although the mechanism remains obscure, it is believed that the transfected DNA 
enters the cytoplasm of the cell by endocytosis and is transported to the nucleus. Depending 
on the cell type, up to 90% of a population of cultured cells can be transfected at any one time. 
Because of its high efficiency, transfection mediated by calcium phosphate or DEAE-dextran 
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is the method of choice for experiments that require transient expression of the foreign DNA in 
large numbers of cells. Calcium phosphate-mediated transfection is also used to establish cell 
lines that integrate copies of the foreign DNA, which are usually arranged in head-to-tail 
tandem arrays into the host cell genome. 

The application of brief, high-voltage electric pulses to a variety of mammalian and plant cells 
leads to the formation of nanometer-sized pores in the plasma membrane. DNA is taken 
directly into the cell cytoplasm either through these pores or as a consequence of the 
redistribution of membrane connponents that accompanies closure of the pores. 
Electroporation can be extremely efficient and can be used both for transient expression of 
cloned genes and for establishment of cell lines that cany integrated copies of the gene of 
interest. Electroporation, in contrast to calcium phosphate-mediated transfection and 
protoplast fusion, frequently gives rise to cell lines that carry one, or at most a few, integrated 
copies of the foreign DNA. 

Following transfection, the cell is maintained under culture conditions for a period of time 
sufiScient for expression of the fiision protein of the present invention. Culture conditions are 
well known in the art and include ionic composition and concentration, temperature, pH and 
the like. Typically, transfected cells are maintained under culture conditions in a culture 
medium. Suitable medium for various cell types are well known in the art. In a preferred 
embodiment, temperature is from about 20 °C to about 50 °C. pH is preferably from about a 
value of 6.0 to a value of about 8.0. Other biological conditions needed for transfection and 
expression of an encoded protein are well known in the art. 

Transfected cells are maintained for a period of time sufficient for expression of the fusion 
protein and typically, maintenance time is from about 2 to about 14 days. When using 
recombinant techniques, the fusion protein can be produced intracellularly, in the periplasmic 
space, or directly secreted into the medium. If the polypeptide is produced intracellularly, as a 
first step, the particulate debris, eittier host cells or lysed cells (e.g. resulting from 
homogenization), is removed, for example, by centrifugation or ultrafiltration. 

To direct a protease prodomain fusion protein of the present invention into the secretory 
pathway of the host cells, a secretory signal sequence (also known as a leader sequence or pre 
sequence) is usually required. In the present invention flie prodomain sequence of the protease 
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is part of the fusion protein and thus secretion of the fusion protein is easily effected by 
including a signal sequence such as that defined in SEQ DD NO. 2. 

Thus, the recombinant fusion protein is recovered or collected either from the transfected cells 
5 or the medium in which those cells are cultured. The fiision protein is then subjected to one or 
more purification steps. In one embodiment of the invention, the recovery step involves 
exposing a composition comprising the fusion protein to a solid phase &at has immobilized 
thereon subtilisin or a variant thereof that binds with the prodomain protein with high afiBnity 
to form a protease/protease prodomain binding complex. The solid phase may be packed in a 
10 column and the immobilized corresponding protease captures the fusion protein and 
chemically and/or physically modifies the fusion protein to release the second protein. 

By "solid phase" is meant a matrix comprising a protease to which a fusion product can 
adhere. The solid phase may be a purification colimm, a discontinuous phase of discrete 

15 particles, a membrane or filter. Examples of materials for forming the solid phase include 
polysaccharides (such as agarose and cellulose); and other mechanically stable matrices such 
as silica (e.g. controlled pore glass), poly(styrenedivinyl)benzene, polyacrylamide, ceramic 
particles and derivatives of any of the above. In preferred embodiments, the solid phase 
comprises controlled pore glass beads retained in a colimin that is coated with a protease for 

20 binding with high affinity for the prodomain protein of tiie fusion protein product. 

The phrase "binding with high aflSnity" as used herein refers to the ability of the protease 
prodomain to bind to the cognate protease with a Kd of nM to pM and ranging from about 10 
nM to about 10 pM, preferably < 100 pM. 

25 

This invention also relates to diagnostic detection of proteins of interest in test samples, 
especially in biological samples, such as tissue extracts or biological fluids, such as serum or 
urine through use of the fusion protein of the present invention. The biological samples are 
preferably of mammalian origin and most preferably of human origin. In one embodiment of 
30 the present invention, the fusion protein may comprise an antibody which is used to detect the 
presence of an antigen in biological samples using a variety of immunoassay formats well 
known in the art. Alternatively, the second protein of the fusion protein is comprised of an 
antigenic epitope useful in the detection of antibodies that recognize the antigenic determinant. 
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The "antibody" as used herein is meant to include polyclonal antibodies, monoclonal 
antibodies (MAbs), humanized or chimeric antibodies, single chain antibodies, anti-idiotypic 
(anti-Id) antibodies, and epitope-binding fragments of any of the above. 

5 The term "detectable label" as used herein refers to any label which provides directly or 
indirectly a detectable signal and includes, for example, en2ymes, radiolabelled molecules, 
fluoresors, particles, chemiluminesors, enzyme substrates or cofactors, enzyme inhibitors, 
magnetic particles. Examples of enzymes useful as detectable labels in the present invention 
include alkaline phosphatase and horse radish peroxidase. A variety of methods are available 

10 for linking the detectable labels to proteins of interest and include for exanq>le the use of a 
bifunctional agent, such as 4,4'-difluoro-3,3'-dinitro-phenylsulfone, for attaching an enzyme, 
for example, horse radish peroxidase, to a protein of interest. The attached detectable label is 
then allowed to react with a substrate yielding a reaction product which is detectable. 
Also falling within the scope of the present invention is the therapeutic or diagnostic use of a 

15 protease prodomain fusion product wherein the second protein is a monoclonal antibody 
having affinity for an antigenic epitope. For example, a protease prodomain fusion product 
comprising (i) a protease prodomain capable of binding to a cognate protease with high 
affinity, and (ii) a monoclonal antibody capable of binding antigen can be used in a method to 
target a drug/protease complex or imaging agent/protease complex to a cancer cell producing 

20 the antigen. In this embodiment, a protease prodomain linked to a second protein (monoclonal 
antibody) is administered to a mammal. Either concurrently with or following the 
administration of the fusion product, a drug/protease or an imaging agent/protease complex is 
administered. Binding of the drug/protease or imaging agent/protease complex to the protease 
prodomain fusion product localized at the site of the antigen directs and targets the drug or 

25 imaging agent to the relevant site for the desired therapeutic or diagnostic activity. 

The invention is further illustrated in the following examples, which are not intended to be in 
any way limiting to the scope of the invention as claimed. 

30 Methods and Materials 

Selection of Mutations. Cloning and Expression 

The specific point mutations set forth in the present application identify the particular amino 
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acids in the subtilisin BPIST amino acid sequence, as set forth in SEQUENCE ID NO: 3 (Figure 
2), that are mutated in accordance with the present invention. For example, the S149 mutant 
comprises a deletion of amino acids 75-83 and additionally comprises the following 
substitution mutations: Q2K, S3C, P5S, S9A, BIL, K43N, M50F, A73L, E156S, G166S. 
GI69A, S188P, Q206C, N212G, K217L, N218S, T254A and Q271E. Additional mutated 
variants are set forth in Table 1 as shown in Figure 3. 

The subtilisin gene from Bacillus amyloliquefiiciens (subtilisin BPN^ had been cloned, 
sequenced, and expressed at high levels from its natural promoter sequences in Bacillus 
subtihs [9, 10]. All mutant genes were recloned into a pUB HQ-based expression plasmid and 
used to transform B. subtilis. The B. subtilis strain used as the host contains a chromosomal 
deletion of its subtilisin gene and therefore produces no background wild type (wt) activity 
(Fahnestock et al., Appl. Environ. Microbial. 53:379-384 (1987)). Oligonucleotide 
mutagenesis was carried out as previously described. [17]. 

Wild type subtilisin and the variant enzymes were purified and verified for homogeneity 
essentially as described in Bryan et al., [17, 94 and 95], In some cases the C221 mutant 
subtilisins were re-purified on a sulflhydryl specific mercury affinity column (Affi-gel 501, 
Biorad). 

Cloning and Expression of the Prodomain of Subtilisin 

The prodomain region of the subtilisin BPN* gene was subcloned using the polymerase chain 
reaction as described in Strausberg, et al. [138], Mutagenesis of the cloned prodomain gene 
was performed according to the oligonucleotide-directed in vitro mutagenesis system, version 
2 (Amersham International pic) 

Example 1 

To demonstrate the feasibility of prodomain-directed processing, a gene was constructed to 
direct the S3mthesis of a fusion of the pR8 prodomain onto the N-terminus of the 56 amino acid 
B domain (Gb) of streptococcal Protein G. Prodomain pR8, having the mutations at amino 
acid residues 16-21 (QTMSTM) which were replaced with SGDC creating a two amino acid 
deletion in pR8, wherein S replaces Q16, G replaces T17, Ml 81 replaces S19 and T20 and "K" 
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replaces M21; along with additional substitutions A23C, K27Q, V37L, Q40C, H72K and 
H75K is independently stable and binds to subtilisin with 100-times higher affinity than the 
wild type prodomain. Further, pR8 thus becomes the cognate sequence specifying the 
subtilisin cleavage site. 

The fusion protein (IjxM) was mixed wifli 1 |iM of wild type subtilisin. The fusion protein 
was rapidly and specifically cleaved to release Gb firom pR8, From the results several relevant 
observations were made including that: 1) The processing is a single turn-over reaction with 
strong product inhibition by pR8 at the end of a cycle; 2) The rate of a single cycle of 
cleavage is limited by the substrate binding rate (le ^ M^'s'^); and 3) Processing is highly 
specific because Gb is quite resistant to subtilisin activity. 

Example 2 

Mutations to decrease subtilisin activity against non-cognate sequences. 

Using pR8 to direct cleavage in and of itself does not create a optimal processing system 
because of subtilisin's high activity against non-cognate sequences. The next step was to 
engineer subtilisin to be less active against non-cognate sequences. The starting point for 
engineering a processing subtilisin was a mutant denoted S149 : (Q2K, S3C, P5S, K43N, 
A73L, 75-83, E156S, G166S, G169A, S188P, Q206C, N212G, K217L, N218S, T254A and 
Q271E). S149 previously was engineered for high stability and ability to fold independently 
of the prodomain. These characteristics, while not essential, are highly desirable in a 
processing enzyme. 

First, the mutations G128S and Y104A were introduced in S149 (denoted S160) to enlarge the 
S4 pocket [48, 51]. The catalytic properties of S149 and S160 were analyzed against two 
fluorogenic substrates, sDVRAF-AMC and sDFRAM-AMC, using transient state Idnetic 
methods. The enlarged S4 pocket in SI 60 coupled a pre-existing preference for M over F at 
the PI position resulted in a 100-fold preference of sDFRAM-AMC (Ks = 0.8|iM) over 
sDVRAF-AMC (Ks = 83|iM). In comparison S149 prefers sDFRAM-AMC (Ks = l|iM) by 
five fold over sDVRAF-AMC (Ks = 5|iM). Thus, a modified subtilisin could be engineered to 
increase preference for cognate sequences. 
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Example 3 

A version of pR8 was constructed with its last four amino acids (AHAY) replaced witit FRAM 
(denoted pR58). pR58 inhibits Sl60 wifli a Ki of - 30 pM. An N-tenmnal fusion of pR58 
onto the Gb domain was found to bmd to S160 with a substrate affinily (Ks) in the pM range, 
5 at least le5-times greater than even the highly preferred pentapeptide substrate sDFRAM- 
AMC. Essentially the prodomain structure acts as an amplifier of the PI and P4 sequence 
signals. Hydrolysis is limited to a single turn-over by strong product inhibition. Product 
inhibition is difficult to avoid in using high substrate affinity to direct specific cleavage 
because of the structural similarity between substrate and product. We fterefore do not 
10 attempt to obviate this property. As will be described later, the single turn-over reaction can 
be exploited in applying the system to protein purification. 

A modified version of SI 60 with S166G was also constructed (denoted SI 93). The mutant 
prefers F and Y as P4 and PI amino acids, respectively. 

15 

The preferential binding of SI 60 to pR58-fusions relative to non-cognate sequences does not 
results in highly specific cleavage. The reason for this can be discerned by considering the 
following mechanism for a single catalytic cycle: 

20 ki k2 

E+S -> ES -> EX + P 

< — 

25 The rate of the release of product dP/dt = k2ki[S]/(ki[S] + k.i + ka). 

In the reaction of sDFRAM-AMC with S160, the substrate ofiTrate (k.i) is ~10s"^ compared to 
an acylation rate (ka) of 100s"^ In the reaction of pR58-Gb, the acylation rate is similar but k.i 
is five orders of magnitude smaller (1x10"^ s"^). The ka term in the denominator of the rate 

30 equation is > 10-times larger than the k.i term in both cases, however. Thus, k-i has little 
influence on the observed rate of product formation. Substrate affinity would become 
increasingly important, however, if the acylation rate were slow enough that equilibriiun 
between enzyme and substrate were approximated. Slowing k2 was accomplished with 
mutations in the catalytic amino acids (D32 in S190), (S221 in SI 94) and the oxyanion hole 

35 amino acid (N155 in S 1 88) (see Table 1 in Figure 3). 
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Mutations in the active site nucleophile S221 A. 

Mutation of the active site serine nucleophile in S160 creates a mutant (S194) which binds 
pR58 fusion proteins with 10 pM affinity. The rate of binding is rapid (-1 xlO^ M s'^), but 
SI 94 cleaves the fusion protein very slowly ( < 100 hr"^). However, the mutant is useful for 
affinity purification of uncleaved fusion proteins. 

Mutations in the oxyanion hole: N155L, N15SQ. 

Removal of the hydrogen bond which stabilizes the oxyanion of the transition state decreases 
the rate of the acylation reaction (ka) by -lOOO-fold. Processing of tiie pR58-GB fusion 
protein by N155 (S188 and S191) mutants is a slow, single turn-over reaction. After the single 
roimd of cleavage pRS8 remains tightly bound to the enzyme. As explained above, this 
reduction in ka creates a large degree of sequence discrimination base on differential substrate 
binding. 

Mutations in the Asp-His couple: Creation of an anion switch. 

Of particular usefulness were the mutations of D32. The carboxylate of D32 hydrogen bonds 
to the catalytic H64 and allows it to act as first a general base and then a general acid during 
acylation. Mutation of the catalytic Asp in trypsin created a drastic decrease in activity around 
neutral pH but a strongly hydroxide dependant alternative mechanism evident above pH 10 
[196,197]. The potential of creating a two stage reaction consisting of a binding step followed 
by a chemically-triggered cleavage step led to focus on mutations at D32. Consequently D32 
was mutated to A, S, V, G, and T in SI 60 and SI 93. The sequence specificity of D32 mutants 
is extremely high with kcat/Km 10 M"^ s** vs. sFRAM-AMC. The high specificity was also 
manifested by their inability to process pRS-Gs and also ttieir inability to autoprocess in vivo 
unless the P4 residue of the pro-sequence was mutated fi-om A to F. 



The kinetics of fusion protein pR58-Gb cleavage are shown in Table 2: 



mutation 


D32A (S189) 


D32V (S196) 


D32S (S190) 
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Rate (hr-') 


0.18 


0.3 


1.4 


Reaction in 0. IM KPi, pH 


7.2,23 C 



Example 4 

Particularly advantageous are processing proteases whose activity is triggered on demand. 
Ions which have been useful as triggers are OH- (pH), CI- and F-. The tables summarize 
cleavage rates of various D32 mutants as a function of specific anion. 

Rates vs. pH. for S189 and S190 



pH 


5.7 


7.2 


8.8 


10.0 


S189hr-' 


0.135 


0.18 


0.97 


4 


SI 90 hr-' 


0.18 


1 


5 


25 



Reaction in 0. IM KPi, , 23 C 



Rates vs. [Cl].ForS189 



[Cl] 


OM 


0.5 M 


S189hr-' 


0.97 


5 


Reaction in 0.1^4 


[BCPi,pH8.8, 23 


C 



Rates VS. [F]. For SI 89 



[F] 


OmM 


ImM 


lOmM 


lOOmM 


S189min-' 


0.003 


0.018 


0.14 


0.8 


Reaction in O.IA^ 


[KPi,pH7.2,23 


C 







As shown in Figure 4, the rate of activation is proportional to the concentration of the ions. 
Thus, SI 89 can be trigger to increase cleavage rates if desired and this can be very 
advantageous when required in a purification process. Once the fusion protein is bonded to 
the subtiUsin variant to form a binding complex, the target protein can be cleaved from the 
prodomain protein by activation of the subtiUsin variant with the introduction of an activating 
ion solution. 

Example 5 
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Truncation of the prodomain 

The prodomain of subtilisin can be replaced with a much shorter cognate sequence which has 
been selected for optimized binding with the processing protease. The amino acids comprising 
variations of only the C-terminal part of the prodomain (E E D K L (F/Y) Q S (M/ITY) can be 
used as a cognate sequence. For example, it has been shown that the IgG binding domain of 
Streptoccoccal Protein G, which has no natural affinity to subtilisin, binds to S194 with a sub- 
micromolar dissociation constant once a nine amino acid C-tenninal tail has been added. 

Example 6 

Immobilization of processing subtilisins for afSnity purification and processing. 

The binding and catalytic properties of processing subtilisin allows them to be used as both the 
affinity n:iatrix and processing protease for purification of proteins tagged with the pR58 
sequence. To demonstrate this point, SI 89 was immobilized on a chromatography resin. 

An E, coli cell lysate containing pR58-Gb was passed over the matrix containing immobilized 
SI 89. The fusion protein bound rapidly to the SI 89 matrix while the impurities were washed 
through the matrix as shown in Figure 5. Cleavage of the bound fusion protein then was then 
effected either by addition of a triggering anion (e.g. lOmM KF) or by extended incubation 
(e.g. 1 8 hours at pH 7.2) as shown in Figure 6. After cleavage the pure, processed protein was 
washed off the matrix while the cognate prodomain remains tightly bound to subtilisin on the 
matrix. Multiple rounds of purification can be affected by stripping the pR58 from the S189 
column at pH 2.1 and re-equilibrating the column at neutral pH. High stable and facile-folding 
mutants such as those listed in the Table 1 (Figure 3) of Processing SubtiHsin are required for 
column recycling. 

Eight different fusion proteins comprising pR58 and target proteins were purified and 
recovered in good yield by complexing the fusion protein with subtilisin SI 89 or S190, 
including: 

Streptococcal protein Gb domain 56 aa 
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Streptococcal protein Ga domain 45 aa 

Protein Gb mutant G3 11 56 aa 
Staphylococcal Protein Ab domain 56 aa 

Protein Ab mutant A219 56 aa 

5 E. coli hypothetical Yab 1 1 7 aa 

Bovine a-subunit of transducin 350 aa 

M. thermautotrophicus CDC6 379 aa 



As shown in Figure 7, the fusion protein comprising pRS8 (pRSFRAM) linked to Streptococcal protein 
10 Gb domain was complexed and separated on both S189 and 8190 immobilized beds. Lanes 3 and 4 
show tiiat multiple components of different molecular weights are washed through the system. After a 
sufficient incubation period, the fraction of output is limited to protein G, evidenced by the molecular 
weight fraction shown in lanes 5, 6, 7 and 8 relative to the molecular weight of protein G identified in 
lane 10. 

15 

The results of the purification of P subunit bovine transducin (350aa) are shown in Figure 8. As 
evidence by the elution shown in lanes 4-9, tiie target protein is eluted from the column after sufficient 
time for the cleaving the bond between the prodomain protein and the target protein by the activity of 
thesubtilisinS189. 

20 

The results of purification of CDC6 (379 aa) are shown in Figure 9. The fiision protein comprising 
pR58 (pRSFRAM) linked to M.thermautotrophicus CDC6 was complexed and separated on SI 89 
immobilized beds. Lane 2 shows that multiple components of different molecular weights are washed 
through the system in the early period of separation. After a sufficient incubation period, the fraction 
25 of output is limited to CDC6, as evidenced by the molecular weight fraction shown in lanes 4-8. 

Figures 10 A and B show the ^^N HSQC spectra of (a) protein G311 and (b) protein A219 that were 
purified on a S189AL_10 column and recovered therefrom. The two proteins are 59% identical in 
sequence but represent different protein folds. 

30 

Example 7 

Further purification experiments were conducted on the 56 amino acid Streptococcal protein Gb domain 
linked to pR58 (pRSFRAM) wherein the 671 fusion protein (pR58FKAM-Gb) was purified and 
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separated on SI 89 HiTrap NHS column by continuous injection of O.IM KF to demonstrate the 
effectiveness of the release of a target protein when mutant subtilisin was triggered by fluoride ions. 
Figure 11 shows the results when the fusion protein is boimd and washed as in the normal procedure. 
Figure 12 show that the addition of 100 mM potassium fluoride injected at O.lml/min causes the rapid 
cleavage as the fluoride ions come in contact with the bound fusion protein to release of the target 
protein so that it is concentrated as it is washed off the column. Figure 13 shows that the stripping of 
the prodomain (pR58) from the column in O.IM H3PO4 as in the normal procedure. These results show 
that flie release of the target protein can be adjusted by the use of obtain ions as triggers (OH- (pH), Gl- 
and F) to initiate the protease activity of the mutant subtilisins. 

Figure 14 shows the separation of the fusion protein coniprising pR58 (pRSFRAM) linked to 
Streptococcal protein Gb domain on an SI 89 immobilized beds. Lane 1 is the molecular weight 
standards. Lanes 2 and 4 show that multiple components of different molecular weights as washed 
through the system. After the addition of O.IM KF the fraction of output is limited to protein G^, 
evidenced by the molecular weigjit fraction shown in lanes 5 and 6. 
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